The Effects of Lexical Resource Quality on Preference Violation Detection

نویسندگان

  • Jesse Dunietz
  • Lori S. Levin
  • Jaime G. Carbonell
چکیده

Lexical resources such as WordNet and VerbNet are widely used in a multitude of NLP tasks, as are annotated corpora such as treebanks. Often, the resources are used as-is, without question or examination. This practice risks missing significant performance gains and even entire techniques. This paper addresses the importance of resource quality through the lens of a challenging NLP task: detecting selectional preference violations. We present DAVID, a simple, lexical resource-based preference violation detector. With asis lexical resources, DAVID achieves an F1-measure of just 28.27%. When the resource entries and parser outputs for a small sample are corrected, however, the F1-measure on that sample jumps from 40% to 61.54%, and performance on other examples rises, suggesting that the algorithm becomes practical given refined resources. More broadly, this paper shows that resource quality matters tremendously, sometimes even more than algorithmic improvements.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Metaphor Detection using Large-Scale Lexical Resources and Conventional Metaphor Extraction

The paper presents an experimental algorithm to detect conventionalized metaphors implicit in the lexical data in a resource like WordNet, where metaphors are coded into the senses and so would never be detected by any algorithm based on the violation of preferences, since there would always be a constraint satisfied by such senses. We report an implementation of this algorithm, which was imple...

متن کامل

Time Preference and its Effects on Intertemporal

Time preference has a peculiar role in determining the level of economic activities. Time preferenceis the most important origin of interest rate. In this paper we study the founders and defenders'viewpoints about time preference and then we try to criticize them. It seems that discounting futureutilities is resulting from irrationality and it is ethically indefensible too. From mathematical as...

متن کامل

The Impact of Task Complexity along Single Task Dimension on EFL Iranian Learners' Written Production: Lexical complexity

Based on Robinson’s Cognition Hypothesis, this study explored the effects of task complexity on the lexical complexity of Iranian EFL students’ argumentative writing.This study was designed to explore the manipulation of cognitive task complexity along +/-single task dimension (a resource dispersing dimension in Robinson’s triadic framework) on Iranian EFL learners’ production in term of lexica...

متن کامل

Application of Fuzzy Technique for Order-Preference by Similarity to Ideal Solution (FTOPSIS) to Prioritize Water Resource Development Economic Scenarios in Pishin Catchment

Water is a basic demand of sustainable development in most regions of the world. The non-uniform temporal and spatial distribution of water resources will lead to water shortage in arid and semi-arid areas. Pishin catchment is one of the most important catchments in South-East Iran. The basin had been faced with consecutive droughts in recent years. On the other hand, water resources developmen...

متن کامل

Institutional Quality and Curse Resources: An Experimental Study on OPEC Countries

This paper is to study the resource curse applying annual data from 2002 to 2016 for the Organization of the Petroleum Exporting Countries (OPEC) members i.e. Algeria, Iran, Kuwait, Nigeria, Qatar, Saudi Arabia, United Arab Emirates and Venezuela. For this purpose, there were concerned the interactions role of resource abundance and institution quality, and their marginal effect of the countrie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013